SELF-INTERSECTIONS OF RANDOM GEODESICS ON NEGATIVELY CURVED 

SURFACES 



STEVEN P. LALLEY 

ABSTRACT. We study the fluctuations of self-intersection counts of random geodesic segments of 
length t on a compact, negatively curved surface in the limit of large t. If the initial direction vec- 
tor of the geodesic is chosen according to the Liouville measure, then it is not difficult to show that the 
number N(t) of self-intersections by time t grows like nt 2 , where k = km is a positive constant de- 
pending on the surface M. We show that (for a smooth modification of N(t)) the fluctuations are of 
size t, and the limit distribution is a weak limit of Gaussian quadratic forms. We also show that the 
fluctuations of localized self-intersection counts (that is, only self-intersections in a fixed subset of M 
are counted) are typically of size t 3 ' 2 , and the limit distribution is Gaussian. 



1. Fluctuations of Self-intersection Counts 

Choose a point x and a direction 9 at random on a compact, negatively curved surface M, and 
let j(t) = y(t;x, 9) be the (unit speed) geodesic ray in direction 9 started at x. For large t the 
number N(t) — N(-/[0,t}) of transversa^ self -intersections of the geodesic segment 7[0,i] will be 
of order t 2 ; in fact, if Km = (2n\M |) , where \M\ denotes the surface area of M, then 

(1) lim N{t)/t 2 = k m jl 

t — >oo 

with probability 1. See section|3]below for the (easy) proof. Furthermore, the empirical distribution 
of the self-intersection points converges to the uniform distribution on the surface. Similar results 
hold for a randomly chosen closed geodesic [ 9 ] : If from among all closed geodesies of length < 
L one is chosen at random, then the number of self-intersections, normalized by L 2 , will, with 
probability approaching one as L — > oo, be close to K g . These results have been extended [12J to the 
number and distribution of self -intersections at angles in fixed intervals [a, (3\. Closed geodesies 
with no self- intersections have long been of interest in geometry — see, for instance, 012) — and 
recently, M. Mirzakhani [11] found the asymptotic growth rate of the number of simple closed 
geodesies of length < t as t — » oo. That this number is not shows (in view of the Law of Large 
Numbers {D ) that there is substantial variation in the random variable N(t). 

The primary objective of this paper is to investigate the fluctuations (second-order asymptotics) 
of the self-intersection numbers. One's first guess might be that these are of order t, and this is in- 
deed the case; however, lest this seem too obvious we add that if one counts only self -intersections 
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in a (nice) sub-domain U C M then the fluctuations are no longer of order t, but rather i 3 / 2 . (In fact 
we will only prove these statements for smoothed versions of the counts.) One might also guess 
that the rescaled random variable (N(t) — K g t 2 )/t should converge in distribution as t — > oo to a 
Gaussian distribution, but this, as we will show, is probably false: the limit distribution is a weak 
limit of Gaussian quadratic forms. (This does not preclude the possibility that it is Gaussian, but 
the arguments below will make it clear that this is unlikely.) Weak limits of Gaussian quadratic 
forms are known to occur in connection with stationary processes exhibiting long-range depen- 
dence [15[, [10[ (where they are known as Rosenblatt distributions), and also in the connection with 
certain types of [/—statistics llTil . 

Definition 1. A Gaussian quadratic form is a random variable (or its distribution) of the form 
2j=i OjZj, where the random variables Zj are independent, unit Gaussians and 9j are real scalars. 

Unfortunately, the study of fluctuations in the self-intersection numbers N(t) is complicated by 
the (infrequent) occurrence of self-intersections at very small angles. We have not yet been able to 
successfully resolve the technical issues created by such self-intersections, and so we will state our 
main result not for the counts N(t) but rather for a smoothed version N v (t) defined as follows. Let 
ip : R — > [0, oo) be an even, C°° , nonnegative, 2tt— periodic function. For each self-intersection i of 
the geodesic segment 7(0, t], denote by (9, ; e [0, 2tt) the angle of the self-intersection: in particular, if 
the self-intersection occurs at times < < ij < t, then 9i is the angle between the tangent vector 
to 7 at Si and the tangent vector at tj. Define the smoothed self-intersection number 

N(t) 

(2) N v (t)=N v {7[0,t]) = '£ l ip(9 i ). 

i=l 

Clearly, if <p = 1 then N v (t) = N(t). We will prove in section [3] that the smoothed self-intersection 
numbers satisfy a strong law of large numbers (SLLN) analogous to Q): 

(3) lim N v {t)/t 2 = k v /2 

t—>oo 

where 

( 4 ) Kv = s$M\f\w\<**e\dB. 

Theorem 1. Assume that the smoothing function tp is zero in a neighborhood [—a, a] of and positive in 
(a, 7r — a). If the initial point x and direction 6 are chosen randomly according to the Liouville measure, 
then ast^oo, 

(5 ) ^ffl-M' ^g 

where =^ indicates weak convergence to a distribution * in the weak closure of the set of Gaussian quadratic 
forms. This limiting distribution may depend on both the surface M and the smoothing function (p. 

We have not been able to prove that the limit distribution \1/ is nondegenerate, nor that it is 
non-Gaussian, but this seems unlikely (see Section l4~T1 below). It is naturally of interest to consider 
also fluctuations in the empirical distribution of self-intersection points. Let / : M — > R be any 
continuous function on the surface M. For each self- intersection i of the geodesic segment 7[0, t], 
denote by $i € [— n, ir] the angle and Xi € M the location of the self-intersection. Define the 
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/—localized self-intersection counts 

N(t) 

(6) N v . f (t) = i\W( 7 [0, t}) = £ f(*iM8i)- 

i=l 

As for the global self -intersection counts, the localized self -intersection random variables N^-jit) 
obey a strong law of large numbers: For vl — almost every initial direction vector, 



(7) 



lim N v;f (t)/t 2 = ^ [ f{x) dx := A v;f 
t->oo 2 J M 



where dx indicates the normalized surface area measure on the surface M . See section l5Tl for the 
proof. What is interesting about the localized self-intersection counts is that their fluctuations are 
of a different order of magnitude than those of the global variables N v (t), at least for functions / 
of small support: 

Theorem 2. For any compact, negatively curved surface M there is a constant e > such that the following 
is true: If the smoothing function ip satisfies the hypotheses of Theorem^ if f : M —* E is any C°°, 
nonnegative function that is not identically and whose support has diameter < e; and if the initial direction 
vector 7(0) is chosen randomly according to the Liouville measure, then for some a — a v -j > 0, 

(8) i\W(t)~^ 

ast—>oo, where $ is the standard (mean 0, variance 1) Gaussian distribution. 

Simple lower bounds for e > will be given in Section |5j where Theorem [2] will be proved. 
Theorem Q] will be proved in section 2J using an extension of a mixing result of Dolgopyat [ 5 1 . 
The proof relies on a representation of the self-intersection counts in terms of what we dub the 
intersection kernels for the geodesic flow; these are studied in section 12 For completeness, we 
present proofs of the strong laws of large numbers CD and © in section|3l and of (01 in section l5Tl . 



2. Intersection Kernel 

2.1. The intersection kernel. Geodesies on any surfac^ regardless of its curvature, look locally 
like straight lines. Consequently, for any compact surface M with smooth Riemannian metric there 
exists q > such that if a and (3 are geodesic segments of length < g then a and (3 intersect 
transversally if at all, in at most one point. Thus, if 7 is not periodic then the smoothed self- 
intersection number N V (L) = iV v (7[0, L]) can be computed by partitioning 7 [0, L] into nonover- 
lapping segments of common length S < g and counting the number of pairs that intersect transver- 
sally. Let ai and ctj be two such segments; then the event that these segments intersect is com- 
pletely determined by their initial points and directions, as is the angle of intersection. 

Definition 2. The intersection kernel Hg : SM x SM — * R + is the nonnegative function that takes 
the value Hs(u, v) — ip(9) if the geodesic segments of length 5 with initial tangent vectors u and v 
intersect transversally at angle 9, and Hs(u, v) = otherwise. 



Here and throughout the paper the term geodesic will be used either to indicate a geodesic path in the surface M or 
its lift to the unit tangent bundle SAI; the meaning should be clear from context. Two geodesic segments will be said to 
intersect if their projections to the surface M intersect. 
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The dependence on the smoothing function ip is suppressed, as <p will be fixed throughout the 
paper. Because tp is assumed to be even and it— periodic, Hg is symmetric in its arguments u, v. 
The intersection kernel determines the smoothed self-intersection numbers as follows: If L = n5 
is an integer multiple of S, then for any geodesic 7, 

n n 

(9) N V (L) = N v fr[0,L]) =2EE^W^),7(^)). 

»=i j=i 

The factor of 1/2 compensates for the double-counting that results from letting both indices of 
summation i,j range over all n geodesic segments. Note that the diagonal terms Hs(~f(i5), j(i8)) 
in this sum are all 0, because the segment ^(iS) does not intersect itself transversally. 

2.2. The associated integral operators. The intersection kernel H$(u,v) is symmetric in its ar- 
guments and Borel measurable, but not continuous, because self-intersections can be created or 
destroyed by small perturbations of the initial vectors u, v. Nevertheless, H s induces a self-adjoint 
integral operator on the Hilbert space L 2 (v L ) by 

(10) H s ^(u)= f H s (u,v)iP{v)dv L {v). 

JveSM 

Lemma 1. For all sufficiently small 8 > 0, 

(11) H s l(u) := J H 5 (u,v)du L (v) = 8 2 k v 

for all u e M. Thus, the constant function 1 is an eigenfunction of the operator Hs, and consequently the 
normalized kernel Hg(u, v)/5 2 n v is a Markov kernel. 

Remark 1. The integral Hs 1 (u) is the expectation of (p(9) where 6 is the angle in which a randomly 
chosen geodesic segment of length S intersects the geodesic segment of length 5 with initial tangent 
vector u. The assertion of the lemma is that this expectation does not depend on the initial tangent 
vector u, even for surfaces M of variable negative curvature. 

Proof. Denote by a = 7( [0, 8] ; u) the geodesic segment of length 5 with initial tangent vector u. For 
small S > and fixed angle 9, the set of points x £ S such that a geodesic segment of length 5 with 
initial base point x intersects a at angle 6 is approximately a rhombus of side 6 with an interior 
angle 9. The area of such a rhombus is 5 2 \ sm0\. Hence, as 5 — > 0, 

(12) J H g (u,v)du L (v) - 5 2 v{9)\sm9\d9/{2ir\M\)=5 2 K lfn 

and the relation <~ holds uniformly for u e SM. 

It remains to show that the approximate equality <~ is actually an equality for small 5 > 0, 
equivalently, that the value of the integral stabilizes as 5 — > 0. Assume that S > is sufficiently 
small that any two distinct geodesic segments of length 5 intersect transversally at most once. 
Consider the geodesic segments of length 5 with initial direction vectors u and v. For any integer 
m > 2, each of these segments can be partitioned into m nonoverlapping sub-segments (each 
open on one end and closed on the other) of length S/m. At most one pair of these constituent 
sub-segments can intersect; hence, 

rn — 1 m— 1 

H s (u,v) =^2Y1 H s/m("/(iS;u),j(jS;v)). 

i=0 7=0 



SELF-INTERSECTIONS OF RANDOM GEODESICS 



5 



Integrating over v with respect to the Liouville measure vl and using the invariance of Vl relative 
to the geodesic flow we obtain that 

m—1 

H S l(u) = ^ mH S/rnHl(i$;u)). 
i=0 

Let m — > oo and use the approximation (p~2t (with 6 replaced by S/m); since this approximation 
holds uniformly, it follows that Hgl(u) = S 2 ks- D 

Lemma 2. For each 5 > sufficiently small, the integral operator Hg on L 2 (vif) is compact. 

Proof. If the kernel Hg (u, v) were jointly continuous in its arguments u, v then this would follow by 
standard results about integral operators — see, e.g., [16|. Since Hg is not continuous, these stan- 
dard results do not apply; nevertheless, the argument for compactness is elementary. It suffices to 
show that the mapping u ^ Hg(u, ■) is continuous relative to the L 2 — norm. Take u, u' £ SM, and 
let a, a 1 be the geodesic segments of length 5 started at u, u' , respectively. If u, u' are close, then 
the geodesic segments a, a' are also close. Hence, for all but very small angles 6 the set of points 
x £ M such that a geodesic segment of length 5 with initial base point x intersects a at angle 6 but 
does not intersect a' is small. Consequently, the functions Hs(u, •) and Hg(u', ■) differ on a set of 
small measure. □ 

Lemma |2] implies that the Hilbert-Schmidt theory applies. In particular, the non-zero spec- 
trum of Hg consists of isolated real eigenvalues Xj of finite multiplicity (and listed according to 
multiplicity). The corresponding (real) eigenf unctions ijjj can be chosen so as to consititute an 
orthonormal basis of L 2 (vif), and the eigenvalue sequence Xj is square-summable. 

Lemma 3. The Markov kernel Hg := Hg/S 2 K V satisfies the Doeblin condition: There exist an integer 
n > 1 and a positive real number e such that 

(13) H£(u, v)>e for all u, v e SM. 

Proof. Chose n so large that for any two points x,y G S there is a sequence {iEi}o<i<n of n + 1 
points beginning with xq = x and ending at x n = y, and such that each successive pair Xi, a^+i are 
at distance < 6/4. Then for any two geodesic segments a, p of length 6 on S there is a chain of n+ 1 
geodesic segments a„ all of length 8, beginning at ao = ct and ending at a n = P, such that any two 
successive segments on and aj+i intersect transversally. Since the intersections are transversal, the 
initial points and directions of these segments can be jiggled slightly without undoing any of the 
transversal intersections. This implies lfl3)l . □ 

Corollary 1. The eigenvalue S 2 n v is a simple eigenvalue of the integral operator Hg, and the rest of the 
spectrum lies in a disk of radius < 5 2 n v . 

Proof. This is a standard result in the theory of Markov operators. □ 

Corollary 2. For every j > 2 the eigenfunction ipj has mean zero relative tov^, and distinct eigenfunctions 
are uncorrelated. 

Proof. The spectral theorem guarantees orthogonality of the eigenfunctions. The key point is that 
tpi = 1 is an eigenfunction, and so the orthogonality ipj _L ipi implies that each ipj for j > 2 has 
mean zero. □ 
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Lemma 4. If 5 > is sufficiently small then Hs has eigenvalues other than and Xx(5). 

Proof. Otherwise, the Markov operator Hs would be a projection operator: for every ip S l?{yif) 
the function Hgip would be constant. But if <5 > is small, this is obviously not the case. □ 

2.3. Smoothing. The discontinuity of the kernel Hs creates certain technical problems: for in- 
stance, the eigenfunctions ipj need not be continuous. Thus, it will be to our advantage to approx- 
imate Hs by a smooth kernel Ks in such a way that the sums (0 are not too badly disturbed when 
Hs is replaced by the approximation Ks . It is solely for the purpose of constructing this approxima- 
tion that the restrictions in Theorem [T] on the smoothing kernel tp — in particular, that it vanishes 
in a neighborhood of — are needed. 

Let p(s) be an even, C°° probability density on R with support contained in the interval [—1,1] 
Define V to be the set of pairs (u, v) e SM x SM such that the geodesies through u and v cross, if 
at all, transversally; this set is dense in SM x SM. For any pair (u, v) € V, the (two-sided) geodesies 
{j(s; w)} s >o and { 7 (i; v)} t >o will intersect in at most countably many points, which can be labeled 
(si,U) where the entries s, and U denote the signed distances along the two geodesies from their 
origins u, v where the intersection occurs. (The intersections do not generally occur in the same 
(time) order along the two geodesies.) Let 9i € (0, it) be the angle of crossing at the ith intersection, 
and define 

(14) K s (u, v)=J2 5- 2 P {s l /5)p{t l /8) V {e i ). 

i 

Lemma 5. Assume that the smoothing function ip is zero in a neighborhood [—a, a] of 0. If 5 > is 
sufficiently small then for any pair (u, v) e T>, the sum {Ml contains at most one nonzero term, so Ks(u, v) 
is well-defined and finite. Furthermore, the function Ks(u,v) extends to a C°°, symmetric function on 
SM x SM, by setting K s (u, v) = Ofor all (u, v) £ V. 

Proof. Since ip is even, the kernel Ks is symmetric if the sum ((Ml is finite. Now in order that the ith 
term of the sum (fL4b be nonzero, the distances \si \ and \tj\ must both be smaller than 8, because p 
has support contained in [—1,1]. But if 5 > is sufficiently small then any two geodesic segments 
of length 28 will intersect transversally at most once. Thus, the sum ((T4)l has at most one nonzero 
term. 

Since geodesies vary smoothly with their initial conditions, the intersection distances Si , ti and 
angles 6^ vary smoothly with u, v in T>. Consequently, the function Ks is C°° in T>. But as (u, v) 
approaches the boundary of T>, the angle(s) of intersection of the geodesies through u and v must 
approach zero. Hence, by the assumption on tp, the kernel Ks vanishes near dT>. □ 

Lemma 6. For any geodesic 7, the smoothed self-intersection number N<p(i) of the segment j[0,t] satisfies 
the inequalities 

(15) Iff ^ 4 ( 7 (r), 7 ( S ))drd S <^(t)<i ff K 5 (~f(r), 7 (s)) dr ds. 

Z JJ[S,t-S] 2 Z JJ[-SA+8] 2 

Proof. Suppose that 7 has a self-intersection at some (r, s), that is, the vectors 7 (r) and j(s) lie over 
the same base point in M. Then by definition of Ks, since h is a probability density supported 
by [—8/2, 8/2], the integral of Ks(j(r') : j(s')) over the square of side 8 centered at (r, s) must be 
1. Hence, both bounding integrals in Jl5] | count each such self -intersection (r, s) with weight 1. 
The only self-intersections not counted correctly are those (r, s) where either r or s lies within 8 of 
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one of the time endpoints or t. Adjusting the limits of integration by ±<5 compensates for these 
boundary errors. □ 

Remark 2. The inequalities (fl5|) imply that the double integral on the right side of <TT5t is bounded 
above by i\L( 7 [— 5, t + S]), and the double integral on the left is bounded below by AL( 7 [<5, t — S]). 
Therefore, the errors in the inequalities JT5t are no larger than 

(16) N^-S^ + S^-N^^t-S]). 

□ 

The smoothed kernels Ks( u , v ) enjoy all of the properties enumerated for the intersection ker- 
nels H$(u,v) in sec. 12.21 above, provided the smoothing window S > is sufficiently small. In 
particular, 

(PI) The constant function 1 is an eigenfunction of Kg, with eigenvalue k v . 
(P2) The integral operator Kg is compact. 

(P3) The Markov kernel Kg j k v satisfies the Doeblin condition. 

(P4) The eigenvalue k v is simple, and the rest of the spectrum lies in some [— k v + e, k v — e]. 
(P5) Distinct eigenfunctions are uncorrelated, and except for the constant eigenfunction have 
mean 0. 

These may all be proved by mimicking the proofs of the analogous assertions for the self- 
intersection kernels H$. In the special case that the smoothing density p in the definition {14[| 
is not only even but also nondecreasing on (— oo, 0], property (PI) can be deduced directly from 
Lemma [TJ because in this case the kernel K$ is a convex combination of the kernels H e /e 2 , where 
< e < S. Only Property (PI) will be needed for the proof of Theorem[TJ and we will have no need 
to consider smoothing densities p that are not monotone on (— oo, 0], so we refrain from spelling 
out the details of the proofs of (PI) — (P5) in the general case. 



3. SLLN for Self-Intersections 

3.1. SLLN for the smoothed self-intersection numbers. According to Lemmas|5r[6l if the smooth- 
ing function ip is zero in a neighborhood [—a, a] of 0, then the intersection kernel H$ can be ap- 
proximated by a continuous kernel K$ in such a way that the smoothed self-intersection number 
N v (t) is well-approximated by 

N*(l[0,t]) := \ ff jr tf ( 7 (r),7(»))drd*. 
1 J J[0,t] 

For the strong law of large numbers ©, only a crude bound on the error in this approximation is 
needed. By Remark[2j the error is bounded by ^16) : and by another use of the double inequalities 
((T5l l, it follows that the discrepancy is bounded by the difference N*('y[—5, t + S]) — N*(j[S, t — 5]). 
Consequently, since 5 > can be chosen arbitrarily small, to prove the strong law of large numbers 
(|3) it suffices to show that for S > sufficiently small and for Vl,— almost every initial point 7(0), 

(17) lim t~ 2 Ks(j(si),^(s 2 )) dsids 2 = k v 

iJ[0,(p 
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The following proposition implies that for any probability measure p on SM that is invariant and 
ergodic under the geodesic flow, for /j— almost every initial point 7(0), 

(18) limt~ 2 // K s (^(s 1 ),j(s 2 ))dsids2 = // K s (x,y) dp(x)dp(y). 

JJ[0,t] 2 JJSMxSM 

That the expectation on the right equals k v for p = Vl follows from property (PI) above. 

Proposition 1. Let (X, d) be a compact metric space and let K : X 2 — > K be continuous. If p is a 
Borel probability measure on X and T : X — > X is an ergodic, measure-preserving transformation (not 
necessarily continuous) relative to p, then 

(19) lim -L V V K{T*x, T j x) = ff K(y, z) dp(y)dp(z 
for (j,— almost every x. 



Proof. The function K is bounded, since it is continuous, so the double integral in {19) is well- 
defined and finite. Furthermore, the set of functions K x defined by K x (y) := K(x,y), where x 
ranges over the space X, is equicontinuous, and the function 



K x ■= / K x (y)dp(y) 
J x 

is continuous in x. The equicontinuity of the functions K x implies, by the Arzela-Ascoli theorem, 
that for any e > there is a finite subset F £ = {xi}i<i<i such that for any x € SM there is at least 
one index i < I such that 

\\K X - K Xi \\oo < e. 

It follows that the time average of K x along any trajectory differs from the corresponding time 
average of K Xi by less than e. But since the set F £ is finite, Birkhoff 's ergodic theorem implies that 
if the initial point and direction of 7 are chosen randomly according to then with probability 
one, for each Xi S F s , 



1 



lim - / K(xi,j(s))ds = / K(x il y) dv L (y). 
*-* 00 t Jo J 
It therefore follows from equicontinuity (let e — > 0) and the continuity in x of the averages K x that 
almost surely 

lim - / K(x,'y(s))ds= [ K{x,y)dv L {y) 

t^oc t J J 

uniformly for x € X. The uniformity of this convergence guarantees that (191 holds (i— almost 
surely. □ 

Remark 3. Wiener's multi-parameter ergodic theorem ([17], Theorems I" - II") implies that under 
much weaker hypothese^l on the function K, 

lim — g y~] y~] K(T l x, T j y) = \\ K(u,v) d^j,(u)d(i(v) for (p, x p) — every (x, y). 

Proposition [1] cannot be deduced from this, as the diagonal of X x X has (p x p)— measure 0. 
In fact, it is not generally true that the convergence | |T9t holds for functions K(x,y) that are not 



^Wiener requires that K 6 L 2 (fi x y) . For still weaker hypotheses, see | 7|. 
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continuous, even if the measure-preserving transformation T is mixing. A simple example can be 
constructed as follows. Let R = Rg be an irrational rotation of the circle S 1 , and let a : £ — * S 
be the shift on the space of all one-sided sequences u> = uj\lo 2 ■ ■ ■ with entries ±1. For x 6 S 1 and 
weS, define 

T(x,oj) = (R Ul x,auj); 

this is a mixing, measure-preserving transformation relative to A = Lebesgue x p, where p is the 
product Bernoulli-(l/2) measure on S. Now for x,y E S 1 and u, uo 1 e S, define 

K((x, w), (y, w')) = K(x, y) = l iiy-xe(9) 

= otherwise, 

where (9) denotes the (countable) subgroup of S 1 generated by 9. Clearly, K = almost surely 
relative to the product measure A x A, but K(T l x,T 3 x) — 1 along every orbit of T. Therefore, 
{19) fails. It is not difficult to modify the function K so that the limit fails to exist with probability 
one. □ 

3.2. SLLN for self-intersections. It is not much more difficult to prove the law of large numbers 
CD, by incorporating an additional observation from [9]. First, observe that Proposition [l] can be 
reformulated as a statement about weak convergence of empirical distributions: 

Corollary 3. Let (X, d) be a compact metric space, p, a Borel probability measure on X, and T : X — > X 
an ergodic, measure-preserving transformation relative to fx. For each x <E X and n > 1, let 

n n 

(20) rt = 3EEW.J 

i=l 3=1 

be the empirical measure that puts mass 1/n 2 at each point (T l x, T^x). Then for p,— almost every x € X, 

(21) pi n x pi. 

Consequently, ifUcXxXis any Borel measurable set whose topological boundary satisfies p x p(dU) = 
0, then for p— almost every x S X, 

n n 

(22) lim - VV lu{T l x, T^x) =px p(U). 

i=l j=l 

Proof. The assertion ET) is just an equivalent form of | fl9l l, by the definition of weak-* convergence. 
Given I121L the statement ll22l follows by elementary analysis (see, for instance, Billingsley's "Port- 
manteau Theorem", [1J, Theorem 2.1). □ 

Proof of SLLN {j}. Fix S > small, and consider the representation 10 for N(L) when L = nS. For 
ip = 1, the intersection kernel Hg(x, y) takes the form of an indicator function Hs(x, y) = lu(x, y), 
where U = Us is the set of pairs (x,y) such that the geodesic segments of length S with initial 
points x,y intersect transversally. It is easily checked (see |9J) that the boundary of this set has 
vl x v l — measure 0, so Corollary [3] implies that for almost every initial point, 

lim - T JV(7[0, nS\) = v L x u L (U s ). 

n — ^oo Ti 

By the same argument as in the proof of Lemma [TJ the constant on the right is ~ kmS 2 as S — * 0. 
Thus, the SLLN © follows. □ 
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4. Weak Convergence of Fluctuations 



4.1. Heuristics. We begin by using the results of sections I2.H - 42.21 to give a compelling — but 
non-rigorous — explanation of the weak convergence asserted in Theorem[TJ The Hilbert-Schmidt 
theorem asserts that a symmetric integral kernel in the class L 2 (vl x v£) has an L 2 — convergent 
eigenfunction expansion. The intersection kernel Hg(u, v) meets the requirements of this theorem, 
and so its eigenfunction expansion converges in L 2 {vl X v£): 

oo 

(23) Hg(u, v) = X k ^k(u)ipk(v). 

The L 2 — convergence of the series does not, of course, imply pointwise convergence. Nevertheless, 
let's proceed formally, ignoring convergence issues: Recall (CorollaryEJl that the eigenf unctions are 
mutually uncorrected, and so all except the constant eigenfunction ?/>i have mean zero relative to 
vl- Thus, the representation 10 of the intersection number N v (nS) can be rewritten as follows: 



n 71 

(24) N v (n5) - {n5) 2 K a = - J^fffM**)^*)) - (nS) 2 Kg 

i=i j=i 

1 n n oo 



2 

i=l j=l k=2 



oo / n > 



If the eigenfunctions ijij were Holder continuous, the central limit theorem for the geodesic flow 
|13[ would imply that for any finite K the joint distribution of the random vector 



(25) (-^=j2Mim 



2<k<K 



converges, as n — > oo, to a (possibly degenerate) K— variate Gaussian distribution centered at the 
origin. (The central limit theorem in [13J is stated only for the case K = 1, but the general case 
follows by standard weak convergence arguments [the "Cramer- Wold device"], as in (T), ch. 1.) 
Hence, for every K < oo the distribution of the truncated sum 

K / n \ 2 

(26) -£A fe (<5) 5> fe ( 7 (*<5)) 

k=2 \i=l / 

should converge, as n — > oo, to that of a quadratic form in the entries of the limiting Gaussian 
distribution. 



4 That the limit distribution has the same form as required by Defiiiition[T]follows by the spectral theorem for symmetric 
matrices and elementary properties of the multivariate Gaussian distribution, as follows. Suppose that the limit distribu- 
tion of the random vector 1251 is mean-zero Gaussian with (possibly degenerate) covariance matrix S; this distribution 
is the same as that of S X / 2 Z, where Z is a Gaussian random vector with mean zero and identity covariance matrix. Let 
A be the diagonal matrix with diagonal entries Aj(5). Then the limit distribution of {26) is identical to that of Z T MZ, 
where M = E 1 / 2 AS 1 / 2 . But the matrix M is symmetric, so it may be factored as M = U T DU, where U is an orthogonal 
matrix and D is diagonal. Now if Z is mean-zero Gaussian with the identity covariance matrix, then so is V Z, since U is 
orthogonal. Thus, Z T MZ has the same form as in Definition!!] where 9j are the diagonal entries of D. 
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There are, obviously, two problems with this argument. First, the central limit theorem requires 
that the functions tpj be Holder continuous; but since the intersection kernels Hs are not contin- 
uous, their eigenfunctions ipj will not be continuous either. This problem could be circumvented 
by smoothing. But second, the convergence of the infinite series in (J24j and the interchange of 
limits requires justification. The fact that the series 1 1231 converges in L 2 (vl x vjf) is of no use here, 
because for any s, t > the joint distribution of ('y(s), j(t)) is singular relative to x vjj. If the ker- 
nel Hs(u, v) were continuous and positive semi-definite, then Mercer's theorem (|4|, ch. 3) would 
imply pointwise — in fact, uniform — convergence of the series; unfortunately, neither Hg nor its 
smoothed version K$ is positive semi-definite. 

The way around these difficulties, it seems, is not to use the eigenfunction expansion, but in- 
stead to approximate, in C m — norm for suitable m, the smoothed kernel Ks(x,y) by elementary 
kernels, that is, finite sums of the form 

j j 

(27) h(x,y) —^^CLijipiix^jiy), 

i=l j=l 

where the functions ipj are C m , with mean zero relative to vl, and the matrix (dij) is symmetric. 
A repetition of the argument given above shows that for any such kernel h, 

T' 1 1 1 h{j{s),j{tj) dsdt G 

J J[0,T] 2 

where G is a (possibly degenerate) quadratic form in independent unit Gaussian random variables. 
Thus, to prove Theorem [T] it will suffice to show that the error incurred in approximating Kg by h 
in the integral 

T- 1 [[ K 5 ( 7 (s), 7 (t))dsdt 

JJ[0,T] 2 

is small uniformly for T > 1. In the remainder of this section we shall show that such estimates 
reduce to a problem of mixing for the geodesic flow. 

4.2. Multiple Mixing Rates for the Geodesic Flow. The proof of Theorem Q] will rely on an ex- 
tension of DOLGOPYAT's theorem (5) on exponential mixing rates for the geodesic flow. For the 
sake of simplicity, since our only interest is in the case of the Liouville measure vl, we will state 
the extension only for this measure. Denote by E expectation relative to Vl, and by jt = j(t) the 
geodesic ray whose initial direction vector 70 is randomly chosen according to vl- 

Theorem 3. For each K = 2,3, . . ., there exist constants C = Ck < co, A — A K > 0, and m = 
rtiK € N such that for any mean-zero, real-valued functions F±, F 2 , . . . , Fx of class C m on SM and all 
—00 = to < ti < t% < ■ ■ ■ < tx < tx+i — 00, 

(28) E Y[ < C (mm K m^(e- A ^-^,e- A ^-^)) f[ ||^|| m . 

Here \\F\\ m denotes the C m norm of the function F, that is, 

(29) ||F|| m := min sup \D n D. l2 ■ ■■ D ik F(x)\ 

k < m x£SM 

where the maximum is over all mixed partial derivatives of order < m in the directions of unit tangent 
vectors to SM. 
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Dolgopyat [5 J proves in the case K = 2 (for functions of differentiability class C 7 ) that for t > 0, 



(30) 



EF ( 7 (0))F 1 ( 7 (t)) 



^Ce-^llFollllFJ 



The inequality | [28|l is a natural extension of this, and can be proved in a similar fashion (using in 
addition the induction strategy in (6|). Because the arguments are so similar, we omit the details. 
(In fact, the argument below - see Lemma[9]- require only the case K = 4.) 

4.3. Approximation by Elementary Kernels. To make use of Theorem|3]we will approximate the 
centered kernels Kg (x, y) :— K$(x, y) — k v by elementary kernels, that is, kernels of the form l l27l l: 

J j 
i=i 3=1 

where the functions ipj are C m , with mean zero relative to vl, and the matrix (dj.j) is symmetric. 
Existence of such approximations (in particular, the requirement that the functions (fj have mean 
zero) depends crucially on the fact that the constant function 1 is an eigenfunction of Kg (Property 
(PI) above): this guarantees that the kernel Kg(x, y) is centered, that is, 

(31) J K* 5 (x, y) v L {dy) = J K(x, y) v L {dy) - k v = for every x G SM. 

Lemma 7. Let g(x,y) be a centered, symmetric kernel of class C m , where m > 0. Then for every e > 
there is an elementary kernel h(x, y) such that 

(32) \\g-h\\ m <e. 

Proof. This is completely standard except for the requirement that the component functions ipj in 
the approximating kernel h be mean-zero. Suppose that g is well-approximated by a kernel h of 
the form l(27|l in which the component functions ipj are not mean-zero. Denote by ipj the mean of 
<Pj relative to vl- Since g(x, y) integrates to against vl [dy) for every x, the inequality ((32b implies 
that 

j j j j 



< e, whence 



< £. 



i=l j=l i=l 3=1 

Consequently, g(x,y) is also well-approximated by the kernel 

J J 

h * {x, y) — ai >Wi ^ where <Pj ( x ) = W ( x ) - <Pj ■ 

i=l 3=1 



□ 



We will need a quantitative version of the approximation l|32l , in which the C m — norms of the 
terms a,ij<pj(x)<pi(y) are controlled by the size of the kernel g(x, y). This can be done, but at the 
cost of a more stringent differentiability requirement on g(x, y). 

Lemma 8. For every m > 1 there exists C < oo such that the following holds: For every centered, 
symmetric kernel g(x,y) of class C 2m , the elementary kernel h(x,y) in the approximation d3"2l can be 
chosen so that 



(33) 



kjl < C\\g\\ 2m (i + j)- 



and 



mi 



< i. 
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Proof. First, use a smooth partition of the identity to localize, then use Fourier series approxima- 
tions in the coordinate patches. The coordinate patches Ui in SM x SM can be chosen so that they 
are nearly isometric to cubes in M 6 ; using a matching partition of 1, we obtain a decomposition 



K 

k=l 



where each <?$ is supported by U{ and has C 2m — norm bounded by C"||<?||2m/ with a constant C 
independent of the kernel g. Each gi may now be viewed as a C 2m — function on a cube in M 6 , and 
as such can be expanded in a Fourier sine series. (This may have a nonzero constant term, but this 
will wash out later, since the original kernel g is centered.) Now sin kx, as a function on [0, 2n], has 
L — norm 1/^2, but has C m —norm k m . However, because gi is of class C 2m , its inner product with 
any product of sines can be integrated by parts up to 2m times, so the Fourier coefficient of any sine 
product containing a factor smkx will be bounded in magnitude by C" /k 2m . Thus, the resulting 
series approximation will satisfy j33l for suitable C < oo. Finally, the resulting approximations to 
g can be modified so that the component functions <fk are mean zero, by the same argument as in 
the proof of Lemma [7] □ 

4.4. L 2 Bounds via Approximation. Lemma[7]asserts that every centered, symmetric kernel g(x, y) 
of class C m can be arbitrarily well-approximated, in the C' m norm, by elementary kernels h(x, y). 
This leaves the problem of determining how much of an error might be incurred in replacing g by 
h in the integral 

(34) T- 1 [[ g( 7 (s),^(t))dsdt. 

J J [0,T] 2 

It is obvious that if ||<7 — h\\ m < e then the error cannot exceed eT. But this isn't good enough for 
our purposes: we need the error to be of size 0(1) for large T. This is where the multiple-mixing 
rate provided by Theorem [3] comes in. 

Lemma 9. For in e N sufficiently large there exist constants C = C m < oo such that if g(x,y) is a 
centered, symmetric kernel of class C 2m then for all T > 1, 

(35) E^JI^^g^isl^dsdt] <CT 2 \\g\\ 2 m . 

Proof. Let g(x, y) be a centered, symmetric kernel of class C 2m . By Lemma[7l for each T > 1 there 
exist an elementary kernel h = Ht such that g — h has C"™— norm - and therefore also sup norm - 
smaller than H^l^m/? 13 - Furthermore, by Lemma [8j the kernel h = Kt can be chosen so that it is of 
the form (TZTJl: 

h(x,y) = ^ y £2 i a i jtpi(x)y : j{y), 

i 3 

where the functions (pi are mean-zero with C m — norms bounded by 1, and the coefficients <ij „• 
satisfy 

KA^c'Wghm/ii+jr. 

Since the sup norm of g — h is smaller than \\g\\2m/T 3 , it follows that 

g( 7 (s), 7 (t))dsdt- ff h( 7 (s)^(t))dsdt <C"T- l \\g\\ 2m . 

[0,T] 2 JJ[0,T] 2 
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Consequently, it suffices to establish the inequality | |35|| with g replaced by h = h T . But because h 
is elementary, 



E (jJ [0T]2 h{l{s) ' l{t))dsdt ) 

= 4\^2^2^2^2a iul2 a i3ilA //// EY[(p ij (~f(sj))dsida 2 ds 3 ds4 
n t 2 l3 U J J J J0< Sl <s 2 <s 3 < Si <T J=1 

Now Theorem [3] applies: in particular, since each of the functions <pj has mean relative to Liou- 
ville measure, and since each has C m —norm no larger than 1, Theorem [3] implies that the inner 
expectation is bounded in magnitude by 

C'" min(cxp{-A(s 2 - si)}, exp{-A(si - S3)}). 

Thus, the quadruple integral is bounded by a constant multiple of T 2 . □ 

4.5. Weak Convergence for Centered Kernels. The approximation results of section 1431 together 
with the L 2 — bound provided by Lemma [9] together imply that the normalized integral ((34) has, 
for large T, a distribution close to that of a Gaussian quadratic form: 

Theorem 4. Let g(x, y) be a centered, symmetric kernel of class C°°. Then as T — > 00, 
(36) T- 1 [[ g( 7 (s)Mt))dsdt^F, 



i[0,T] 2 

where F is a probability distribution in the weak closure of the set of Gaussian quadratic forms. 

Proof. By Lemma [7l for each m > 1 and any e > there is an elementary function h(x, y) of class 
C m such that the difference r := g — h has C m — norm less than e. Since h is elementary, it is 
centered and symmetric; hence, so is the difference r. Therefore, by Lemma |9j if m is sufficiently 
large then for all T > 1, 

E ( T_1 // r( 1 (s),-f(t))dsdtj <Ce 

where C = C m < 00 is a constant depending only on the differentiability class C m . On the other 
hand, since h is elementary, it has the form 

J J 
i=i j=i 

that is, it is a finite, symmetric quadratic form in the functions <pj. Consequently, by the same 
argument as in section |4~T1 above , as T — -> 00, 

T' 1 1 1 &(7(«), 7(*)) dsdt -2+ G 



l[0,T] 2 

where G is a (possibly degenerate) quadratic form in independent unit Gaussian random variables. 
Since e > can be chosen arbitrarily small, it follows that for large T the random variable 

T- 1 !! g(j(s) )7 (t))dsdt 

J J [0,T] 2 
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is arbitrarily close in L 2 to a random variable with a distribution close to that of a Gaussian qua- 
dratic form. The theorem follows. □ 

Remark 4. Unfortunately, it seems, this argument gives no information about the limit distribution 
F other than the fact that it is a weak limit of distributions of Gaussian quadratic forms. The 
principal difficulty is in the use of the central limit theorem for the geodesic flow: Even if one 
knows, say, that the functions (fi in the elementary kernel h are uncorrelated under vl, it does not 
follow that the random variables 



i r 

vi Jo 



are uncorrelated, nor that their Gaussian limits will be uncorrelated. Thus, it seems that it is 
impossible to relate the coefficients in the limiting quadratic form to the coefficients dij in the 
expansion of the elementary approximations. 

4.6. Proof of Theorem[lJ Theorem|4]applies to any centered, symmetric kernel of class C°°, and so 
in particular to the kernels Kg defined by fl4) . Consequently, to complete the proof of Theorem [1] 
it suffices to show that the error in the inequalities flHI l is small (in L 1 (vl)) compared to t, for large 
t. Recall (Remark[2]| that the error in ft5|l is bounded by 

(37) N v fr[-5, t + S})~ N v fr[6, t-S]) = M v (rf[-S, S\,-y[-6, t + S}) 

+ M v (j[-8,t + 8],^[t-8,t + S]) 

where M^(a, (3) denotes the weighted sum of the transversal intersections between the geodesic 
segments a and (3, that is, 

where the sum is over all intersections i between a and (3, and 0j is the angle of the ith intersection. 
Hence, since the Liouville measure vl is invariant and reversible under the geodesic flow, the 
expected error in (Tl5t is bounded by 

2EA/ v (7[0,2<5], 7 [2(M + 46]). 

Thus, it suffices to show that this expectation is small compared to t when 8 is small. This is 
implied by the following lemma, which completes the proof of Theorem [T] 

Lemma 10. 

(38) limsupi" 1 EAf (p ( 7 [0,(5],7[0,t]) = 0. 

Proof. Recall that geodesic segments of length < q can intersect at most once, provided q > is 
sufficiently small. Hence, if 5 < q then M v (7 [0 , S] , 7 [0 , t] ) cannot be larger than 1 1 tp \ \ ^ t / q. Thus, the 
expectation in d37t is 0(t). The problem is to prove that the implied constant shrinks to as S — * 0. 
For this we will use the SLLN ® (proved in section[3]l and an averaging trick. The averaging trick 
is this: since M v is additive, 

ft rt + 8 

(39) 5' 1 M v ( 1 [s,s + S}, 1 [0,t})ds < 2N v (rf[0,t]) < (T 1 / M v {j[s, s + 5], 7 [0, *]) ds. 

Js=0 Js=-8 

This implies that AL(7[0,t]) is bounded above by || v^H ooi(t + 28) /(Sq), and so in particular the 
random variables N v (7 [0, t]) /t 2 are uniformly bounded. Therefore, the SLLN implies convergence 
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of expectations: 

(40) lim 2EN v { 1 [0,t])/t 2 = Kip . 

t — >oc 

Next, by additivity of M v and reversibility of the geodesic flow relative to the Liouville measure, 

EM v fr[s, s + S], 7 [0, t}) = EM v ( 7 [0, 6},j[0, t-s-S}) 
+ EM v (7[0,a],7[0,a + a]). 

Substituting this in (|39b yields 

2S- 1 / EA/ v ( 7 [0,<5],7[0,3])ds<2EiV v (7[0,t]). 

Js=S 

Since M v ( 7 [0, <5], 7 [0, s]) is nondecreasing in s, it follows that 

EM v ,( 7 [0,<5],7[0,t/2])/t < 2SEN v (-y[0,t])/t{t-S). 
The desired result ((38) now follows from the convergence of expectations (|40l . □ 

Remark 5. A more sophisticated argument, using the central limit theorem for the geodesic flow, 
shows that the errors in the inequality (fl5t are actually of order y/t. 

5. Central Limit Theorem for N v . f (t) 

5.1. Localized Intersection Kernels. In proving Theorem [2] there is no loss of generality in con- 
sidering only nonnegative functions /, so we shall assume throughout that / : M — > R is a non- 
negative, C°° function and that the smoothing function ip satisfies the hypotheses of Theorem 12 
When convenient, we will view / as a function on SM: that is, for any u = (x, 0) <= SM, set 
f{u) = f{x). Recall that the /—localized self-intersection counts N v -j(t) are obtained by summing 
f(xi)<p(9i), where x% and 9i are the locations and angles of the self-intersections, and the sum is 
over all self-intersections i of the geodesic segment 7 [0, t] . As for the global self-intersection counts, 
the localized counts can be represented by intersection kernels. As is the case for the global intersec- 
tion kernels, the obvious local kernels are not continuous, despite the fact that the functions (p and 
/ are both C°° . Consequently, we define smooth kernels k$ as in Section l23l 

(41) kg(u, v)=J2 5- 2 p{ Sl /5)p{U/8)I{x l )p{e i ) 

i 

where i, Sj, t i: and p are as in the definition l(l4l . By the same argument as in Lemma|5l if 5 > 
is sufficiently small then the sum (flTT l contains at most one nonzero term, and so kg extends to a 
symmetric, C°° function on SM x SM. In addition, by the same argument as in Lemma|6l 

(42) N;.jb[5,t-8\) < N v .j(j[5,t-5}) < N;. f (j[-5, t + 5}) 
where 

(43) N;. f ( 7 [a, b}) :=\j[ _ k s fr(r), j(s)) drds. 



a,b} 2 



Furthermore, by the same argument as in Section l4~6l the errors in the inequalities (|42l are of order 
0(t). Since the limit relation (O involves fluctuations of order i 3 / 2 , it follows that the errors in 
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i can be ignored in proving Theorem [2] Thus, it suffices to prove, for some small 6 > 0, that as 

t — > oo, 



,44) 5*^=44 

Corollary 4. (SLLN) Define 2 As — K v Efs where the expectation is with respect to the Liouville measure. 
Then for all sufficiently small 5 > 0, 

N*. f N . f 

(45) Ax = Aus-t and lim — = lim — = A^.t a.s. 

t^oo t 2 t^oo t 2 

Proof. Proposition Q] implies that N*.j(t)/t 2 converges almost surely to Ag. But the inequalities 
((42) and the fact that the errors are of order 0(t) imply that the limit constant does not depend on 

S. ' □ 

5.2. Lead Eigenfunction. The primary difference between the global intersection kernel Ks and 
the local kernel ks is that the constant function 1 is not, in general, an eigenfunction of the local 
kernel. To see this, define 

(46) f s (u):=—[ k s (u,v)v L (dv). 



/4 C - 



<p J SM 



Lemma 11. 



HmH/j-Zlloo =0. 



Proof. Because kg is non-zero only for pairs u, v at distance < 25, the value of / in pT) will be 
close to f(u) for all small 5, uniformly for u 6 SM. (Here we are viewing / as a function on SM.) 
Hence, 

\f(u)K s (u,v)~ks(u,v)\ < max |/(o) - /(6)|. 

d(a,b)<8 

Since K$l = K v ,by (PI) of section l2~3l the result follows from the continuity of /. □ 

5.3. Cohomology and the Central Limit Theorem. By Lemma ITT1 if / is not cohomologous to a 
constant, then neither is fs provided 5 > is sufficiently small. This follows trivially from the 
definition: 

Definition 3. A C°° function g : SM — > R (or C) is said to be a coboundary relative to the geodesic 
flow if it integrates to zero along every closed geodesic; similarly, g is said to be cohomologous to a 
constant a if g — a is a coboundary. 

It is not so easy to find nonconstant functions that are cohomologous to constants, but it is quite 
easy to construct a function g that is not cohomologous to a constant: Take two closed geodesies 
a and /3 that do not intersect on M, and let g : M — > M be any C°°, nonnegative function that is 
identically 1 along a but vanishes in a neighborhood of (3. In fact, the existence of non-intersecting 



''That there are non-intersecting closed geodesies on any compact surface of constant negative curvature follows from 
the standard representation of such a surface as a geodesic polygon in the hyperbolic plane with sides identified, because 
non-crossing sides will be non-intersecting closed geodesies after the identifications are made. It then follows that there 
are non-intersecting closed geodesies on any compact surface of variable negative curvature, because all variable-negative- 
curvature metrics on a compact surface can be obtained by smooth deformation of the constant curvature metric, and such 
deformations preserve transversal intersections of closed geodesies. For more detail, see, for instance, ] 8 [. 
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closed geodesies yields the existence of a large class of functions that are not cohomologous to 
constants: 

Proposition 2. Let e > be the distance in M between two non-intersecting closed geodesies a and (3. 
Then no C°°, nonnegative, function g : M — ► R that is not identically zero and whose support has diameter 
less than e is cohomologous to a constant. 

Proof. By hypothesis, g vanishes on at least one of the geodesies a, (3. Because closed geodesies are 
dense in SM, their projections are dense in M. Thus, since g is not identically 0, there is a closed 
geodesic £ on which the average value of g is positive. □ 

The importance of the concept of cohomology to us is its relation to the central limit theorem 
for the geodesic flow: If g is not cohomologous to a constant, then there exist^ a = a g > such 
that if 7(0) is chosen at random from the Liouville measure, then 



1 



(47) — / . 9 ( 7 ( s )) da- g dv L $. 

cyr Uo Jsm J 

5.4. Proof of TheoremHJ Rewrite the relation d43b as 

(48) 2N* f ( ri [Q,t}) = 2K v [[ fs(Mr))drds~2A s t 2 + ff fc a ( 7 (r), 7 (s)) drds 

JJ[o,t] 2 JJ[o,t] 2 

where 

ks(u,v) = kg(u, v) - K v (fs{u) + fs(v)) + As. 
Corollary S] implies that As = A v -j for all small 8 > 0. By Proposition |2j the function / is not 
cohomologous to a constant, and so by Lemma [TT] neither is fg, provided 5 > is sufficiently 
small. Consequently, the fluctuations of the first integral in j48"t are of order i 3 / 2 , and the central 
limit theorem implies that for a = tjj.s > 0, 

a 



Tt 3/2 



fs{l{r))drds-A v ., s t z } =^ $. 
[CM] 2 I 



On the other hand, since the kernel ks is C°°, symmetric, and centered, the fluctuations of the 
second integral in J48b are of order t, by TheoremH] Theorem [2] follows. □ 

Acknowledgment. Thanks to Jayadev Athreya for a helpful discussion. 
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